Tibetan functional chunk recognition using statistical based method
نویسندگان
چکیده
منابع مشابه
Tibetan Syllable-Based Functional Chunk Boundary Identification
Tibetan syntactic functional chunk parsing is aimed at identifying syntactic constituents of Tibetan sentences. In this paper, based on the Tibetan syntactic functional chunk description system, we propose a method which puts syllables in groups instead of word segmentation and tagging and use the Conditional Random Fields (CRFs) to identify the functional chunk boundary of a sentence. Accordin...
متن کاملChunk-Based Statistical Translation
This paper describes an alternative translation model based on a text chunk under the framework of statistical machine translation. The translation model suggested here first performs chunking. Then, each word in a chunk is translated. Finally, translated chunks are reordered. Under this scenario of translation modeling, we have experimented on a broadcoverage Japanese-English traveling corpus ...
متن کاملOnline unconstrained handwritten Tibetan character recognition using statistical recognition
K E Y W O R D S online handwritten Tibetan character recognition, de-noising, pre-processing, three-stage classification This is a contribution from Himalayan Linguistics, Vol. 15(1): 31–40. ISSN 1544-7502 © 2016. All rights reserved. This Portable Document Format (PDF) file may not be altered in any way. Tables of contents, abstracts, and submission guidelines are available at escholarship.org...
متن کاملChunk Tagger - Statistical Recognition of Noun Phrases
We describe a stochastic approach to partial parsing, i.e., the recognition of syntactic structures of limited depth. The technique utilises Markov Models, but goes beyond usual bracketing approaches, since it is capable of recognising not only the boundaries, but also the internal structure and syntactic category of simple as well as complex NP’s, PP’s, AP’s and adverbials. We compare tagging ...
متن کاملNamed Entity Recognition using an HMM-based Chunk Tagger
This paper proposes a Hidden Markov Model (HMM) and an HMM-based chunk tagger, from which a named entity (NE) recognition (NER) system is built to recognize and classify names, times and numerical quantities. Through the HMM, our system is able to apply and integrate four types of internal and external evidences: 1) simple deterministic internal feature of the words, such as capitalization and ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Himalayan Linguistics
سال: 2016
ISSN: 1544-7502
DOI: 10.5070/h915130105